[SPARK-22811][pyspark][ml] Fix pyspark.ml.tests failure when Hive is not available. by MrBago · Pull Request #19997 · apache/spark

MrBago · 2017-12-15T22:25:21Z

What changes were proposed in this pull request?

pyspark.ml.tests is missing a py4j import. I've added the import and fixed the test that uses it. This test was only failing when testing without Hive.

How was this patch tested?

Existing tests.

Please review http://spark.apache.org/contributing.html before opening a pull request.

srowen · 2017-12-15T22:39:09Z

@HyukjinKwon I think you might have touched that code last

SparkQA · 2017-12-15T22:48:05Z

Test build #84980 has finished for PR 19997 at commit 46410f1.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

HyukjinKwon · 2017-12-15T23:03:21Z

@MrBago, I think you can just skip when Hive support is disabled if this matters.That test is valid only with a Hive support.

HyukjinKwon · 2017-12-15T23:09:12Z

python/pyspark/ml/tests.py

 import numpy as np
 from numpy import abs, all, arange, array, array_equal, inf, ones, tile, zeros
 import inspect
+import py4j


BTW, mind elaborating how importing this fixes an issue? It sounds orthogonal to me.

On the line below, we catch py4j.protocol.Py4JError so that we can then raise SkipTest instead, but if we don't import py4j we get a NameError instead of skipping the test. Furthermore, because we don't trigger tearDownClass() on the following line we leave behind stale state which causes other tests to fail. The except line is only ever triggered in environments that don't have Hive and should skip this test.

https://github.com/apache/spark/pull/19997/files#diff-4a75aace12688903bc8f97e7930622f4R1868

Ah, it was my bad. Yup, you are right.

It was also written in the JIRA as well. Sorry, I just got up (I am in Korea :) ..) and rushed to leave some comments.

HyukjinKwon · 2017-12-16T01:58:05Z

Merged to master.

Fix pyspark.ml.tests failure when Hive is not available.

46410f1

HyukjinKwon reviewed Dec 15, 2017

View reviewed changes

HyukjinKwon approved these changes Dec 16, 2017

View reviewed changes

asfgit closed this in 0c8fca4 Dec 16, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-22811][pyspark][ml] Fix pyspark.ml.tests failure when Hive is not available.#19997

[SPARK-22811][pyspark][ml] Fix pyspark.ml.tests failure when Hive is not available.#19997
MrBago wants to merge 1 commit intoapache:masterfrom
MrBago:fix-ImageReaderTest2

MrBago commented Dec 15, 2017

Uh oh!

srowen commented Dec 15, 2017

Uh oh!

SparkQA commented Dec 15, 2017

Uh oh!

HyukjinKwon commented Dec 15, 2017

Uh oh!

HyukjinKwon Dec 15, 2017

Uh oh!

MrBago Dec 16, 2017 •

edited

Loading

Uh oh!

HyukjinKwon Dec 16, 2017 •

edited

Loading

Uh oh!

HyukjinKwon commented Dec 16, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

MrBago commented Dec 15, 2017

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

srowen commented Dec 15, 2017

Uh oh!

SparkQA commented Dec 15, 2017

Uh oh!

HyukjinKwon commented Dec 15, 2017

Uh oh!

HyukjinKwon Dec 15, 2017

Choose a reason for hiding this comment

Uh oh!

MrBago Dec 16, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

HyukjinKwon Dec 16, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

HyukjinKwon commented Dec 16, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

MrBago Dec 16, 2017 •

edited

Loading

HyukjinKwon Dec 16, 2017 •

edited

Loading